Customer Number 
00909 



Application Serial No.: 10/042,192 
Attorney Docket No.: 042846-0312966 
Reply and Amendment Under 37 C.F.R. §1.111 



Listing of Claims 

Please replace all prior versions of claims with the following listing of 
clams: 

1 . (Currently Amended) A method for determining a language in which a 
document is created comprising the steps of: 

afreceiving at least one electronic documen t that includes a character 
string, wherein characters in the character string are represented in at least one of a 
plurality of character sets corresponding to an undetermined language ; 

b) ident i fy i ng at loast on e charact e r s e t encod i ng usod in tho at loast ono 
e l e ctron i c docum e nt; 

evaluating at least a portion of the character string by comparing each of 
the characters in the portion of the character string to a plurality of predetermined 
candidate character sets to determine one or more matches between the plurality of 
predetermined candidate character sets and the characters in the portion of the 
character string; 

c) -determining whether the at l e ast one or more character set s that match 
the characters in the portion of the character string correspond to one or more 
supported e ncoding i d e nt i fi e s a languages i n which th e el e ctron i c docum e nt is cr e at e d ; 
and 

d) i ndicating the identifying one or more supported languages in which the 
electronic document is created-if^ a based on a determination is mado that the at l oast 
one or more character set s that match the characters in the portion of the character 
string correspond to one or more supported oncod i ng i dont i fios tho languages i n wh i ch 
th e ele ctron i c docum e nt i s creat e d . 

2. (Currently Amended) The method of claim 1 , wherein the step of of 
determining includes determining d e t e rm i n e s that the at l e a s t one or more character 
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se ts that match the characters in the portion of the character string correspond to 
oncod i ng idontifios at least two supported pot e nt i a l languages associated with i n wh i ch 
the electronic document i s cr e at e d . 

3. (Currently Amended) The method of claim 2, further comprising the step of 
efcomparing at least one group of characters in the portion of the character string 
o l octron i c documont to predetermined groups of characters. 

4. (Currently Amended) The method of claim 3, further comprising the step of 
f)-detecting at least one identification for the at least one group of characters. 

5. (Original) The method of claim 3, wherein the at least one group of 
characters is an n-gram. 

6. (Original) The method of claim 4, wherein the at least one identification is a 
bit-flag. 

7. (Currently Amended) The method of claim 4, further comprising the step of 
§) logically ANDing the at least one identification. 

8. (Currently Amended) The method of claim 7, wherein the step of §}- 
logically ANDing the at least one identification is repeated until a single identification is 
determined. 



-4- 



Customer Number 
00909 



Application Serial No.: 10/042,192 
Attorney Docket No.: 042846-0312966 
Reply and Amendment Under 37 C.F.R. §1.111 



9. (Currently Amended) The method of claim 8, further comprising the step of 
indicating the supported language associated with i n wh i ch the electronic document 

i s creat e d . 

10. (Currently Amended) The method of claim 9, further comprising the step 
of ^identifying a character set associated with e ncod i ng for the supported language 
indicated. 

1 1 . (Currently Amended) A system for determining a language in which a 
document is created comprising: 

receiving means for receiving at least one electronic document that 
includes a character string, wherein characters in the character string can be 
represented in any of a plurality of character sets corresponding to an undetermined 
language ; 

evaluating means for evaluating at least a portion of the character string 
by comparing each of the characters in the portion of the character string to a plurality 
of predetermined candidate character sets to determine one or more matches between 
the plurality of predetermined candidate character sets and the characters in the portion 
of the character string i d e nt i fy i ng m e ans for i d e nt i fy i ng at le ast on e charact e r se t 
oncod i ng used i n the at l oast on e ele ctron i c documont ; 

determining means for determining whethe r tho at loast one or more 
character set s that match the characters in the character string correspond to one or 
more supported e ncoding i d e nt i fi e s a languages i n wh i ch th e ele ctronic docum e nt i s 
croatod ; and 

identifying indicat i ng means for i ndicat i ng th e identifying one or more 
supported languages in which the electronic document is created-fr a based on a 
determination i s mado that the at le ast one or more character sets that match the 
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characters in the portion of the character string correspond to one or more supported 
encoding i dont i fios tho languages i n wh i ch tho o l octron i c documont i s croatod . 

12. (Currently Amended) The system of claim 1 1 , wherein the determining 
means determines wh e th e r that the at l oast one or more character se ts that match the 
characters in the portion of the character string identify encod i ng i d e nt i f ie s at least two 
supported pot e nt i al languages associated with m wh i ch the electronic document \s- 
croatod . 

1 3. (Currently Amended) The system of claim 1 2, further comprising 
comparing means for comparing at least one group of characters in the portion of the 
character string e lectron i c docum e nt to predetermined groups of characters. 

14. (Original) The system of claim 13, further comprising detecting means for 
detecting at least one identification for the at least one group of characters. 

15. (Original) The system of claim 13, wherein the at least one group of 
characters is an n-gram. 

16. (Original) The system of claim 14, wherein the at least one identification is 
a bit-flag. 

17. (Original) The system of claim 14, further comprising logical ANDing 
means for logically ANDing the at least one identification. 
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18. (Original) The system of claim 17, wherein the logically ANDing means 
logically ANDs the at least one identification until a single identification is determined. 

19. (Currently Amended) The system of claim 18, further comprising language 
indicating means for indicating the supported language associated with in wh i ch the 
electronic document is created . 

20. (Currently Amended) The system of claim 19, further comprising character 
set e ncod i ng identifying means for identifying a character set associated with e ncod i ng 
fef the supported language indicated. 

21 . (Currently Amended) A system for determining a language in which a 
document is created comprising: 

a receiving module that receives at least one electronic document that 
includes a character string, wherein characters in the character string can be 
represented in any of a plurality of character sets corresponding to an undetermined 
language ; 

a character set identification module that evaluates at least a portion of 
the character string by comparing each of the characters in the portion of the character 
string to a plurality of predetermined candidate character sets to determine one or more 
matches between the plurality of predetermined candidate character sets and the 
characters in the portion of the character string an id e nt i fying modul e that id e ntif ie s at 
l oast ono charact e r set e ncoding us e d i n th e at le ast ono oloctron i c docum e nt ; 

a determining module that determines whether the at le ast one or more 
character set s that match the characters in the portion of the character string 
correspond to one or more supported oncod i ng idontif i os a languages i n wh i ch tho 
o l octron i c documont is cr e at e d ; and 
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an identifying i ndicating module that ind i catos the identifies one or more 
supported languages in which the electronic document is created-tf^ a based on a 
determination is mad o that the at l oast one or more character set s that match the 
characters in the character string correspond to one or more supported e ncod i ng 
idont i fios tho languages i n wh i ch tho el e ctron i c documont is croatod . 

22. (Currently Amended) The system of claim 21 , wherein the determining 
module determines wh e th e r that the at le ast one or more character set s that match the 
characters in the portion of the character string correspond to e ncod i ng i d e nt i fi e s at 
least two supported potent i a l languages associated with i n wh i ch the electronic 
document i s croatod . 

23. (Currently Amended) The system of claim 22, further comprising a 
comparing module that compares at least one group of characters in the portion of the 
character string o l octron i c documont to predetermined groups of characters. 

24. (Original) The system of claim 23, further comprising a detecting module 
that detects at least one identification for the at least one group of characters. 

25. (Original) The system of claim 23, wherein the at least one group of 
characters is an n-gram. 

26. (Original) The system of claim 24, wherein the at least one identification is 
a bit-flag. 

27. (Original) The system of claim 24, further comprising a logical ANDing 
module that logically ANDs the at least one identification. 
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28. (Original) The system of claim 27, wherein the logically ANDing module 
logically ANDs the at least one identification until a single identification is determined. 

29. (Currently Amended) The system of claim 28, further comprising a 
language indicating module that indicates the supported language associated with i fh 
wh i ch the electronic document i s croatod . 

30. (Currently Amended) The system of claim 29, further comprising a 
character set e ncod i ng identifying module that identifies a character set associated with 
e ncoding for the supported language indicated. 

31 . (Currently Amended) A processor readable medium comprising processor 
readable code that causes a processor to determine a language in which a document is 
created, the processor readable medium comprising: 

receiving code that causes a processor to receive at least one electronic 
document that includes a character string, wherein characters in the character string 
can be represented in any of a plurality of character sets corresponding to an 
undetermined language ; 

evaluating code that causes a processor to evaluate at least a portion of 
the character string by comparing each of the characters in the portion of the character 
string to a plurality of predetermined candidate character sets to determine one or more 
matches between the plurality of predetermined candidate character sets and the 
characters in the portion of the character string i d e nt i fying cod e that causes a proc e ssor 
to id e ntify at l east on e char a ct e r s e t e ncod i ng used i n th e at le ast on e e l e ctron i c 
docum e nt ; 



-9- 



Customer Number Application Serial No.: 10/042,192 

00909 Attorney Docket No.: 042846-0312966 

Reply and Amendment Under 37 C.F.R. §1.111 

determining code that causes a processor to determine whether th e at 
least one or more character set s that match the characters in the portion of the 
character string correspond to one or more supported oncod i ng idontif i os a languages 
i n wh i ch th e ele ctron i c docum e nt i s cr e at e d ; and 

identifying ind i cat i ng code that causes a processor to ind i cato tho identify 
one or more supported languages in which the electronic document is created-if-a. 
based on a determination i s mad e that the at le ast one or more character set s that 
match the characters in the portion of the character string correspond to one or more 
supported e ncod i ng i d e nt i f ie s th e languages i n which th e ele ctronic docum e nt i s 
croatod . 

32. (Currently Amended) The medium of claim 31 , wherein the determining 
code determines wh e ther that the at l east one or more character se ts that match the 
characters in the portion of the character string identify oncod i ng i dontif i es at least two 
supported pot e nt i al languages in wh i ch the electronic document i s cr e at e d . 

33. (Currently Amended) The medium of claim 32, further comprising 
comparing code that causes a processor to compare at least one group of characters in 
the portion of the character string el e ctronic documont to predetermined groups of 
characters. 

34. (Original) The medium of claim 33, further comprising detecting code that 
causes a processor to detect at least one identification for the at least one group of 
characters. 

35. (Original) The medium of claim 33, wherein the at least one group of 
characters is an n-gram. 
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36. (Original) The medium of claim 34, wherein the at least one identification is 
a bit- flag. 

37. (Original) The medium of claim 34, further comprising logical ANDing code 
that causes a processor to logically AND the at least one identification. 

38. (Original) The medium of claim 37, wherein the logically ANDing code 
logically ANDs the at least one identification until a single identification is determined. 

39. (Currently Amended) The medium of claim 38, further comprising 
language indicating code that causes a processor to indicate the supported language 
associated with i n wh i ch the electronic document is creat e d . 

40. (Currently Amended) The medium of claim 39, further comprising 
character set encod i ng identifying code that causes a processor to identify a character 
set associated with oncod i ng for the supported language indicated. 
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